Rank in Wordlist | Frequency | Word |
---|---|---|
3685 | 360 | 1,5 |
5143 | 257 | 2,5 |
5582 | 234 | 1,2 |
6546 | 196 | 3,5 |
7489 | 168 | 0,2% |
7809 | 160 | 0,3% |
7856 | 159 | 0,5% |
7901 | 158 | 0,1% |
8080 | 154 | 1,3 |
8401 | 147 | 1,6 |
Rank in Wordlist | Frequency | Word |
---|---|---|
67398 | 7 | 7-6(5 |
73387 | 6 | 7-6(3 |
80697 | 5 | 0(zero |
91326 | 4 | 6-7(5 |
91374 | 4 | 7-6(2 |
101764 | 4 | partid(e |
106054 | 3 | 6-7(2 |
109002 | 3 | Gade(a |
111459 | 3 | Po(a)nta |
129488 | 2 | 3(trei |
Rank in Wordlist | Frequency | Word |
---|---|---|
17517 | 58 | n.r.). |
47067 | 13 | %) |
57357 | 10 | n.red.). |
67172 | 7 | %). |
67294 | 7 | 21:45)/ |
70395 | 7 | etc)? |
98683 | 4 | etc.)? |
111459 | 3 | Po(a)nta |
119324 | 3 | i)logica |
121701 | 3 | n.r). |
Rank in Wordlist | Frequency | Word |
---|---|---|
1423 | 930 | 50% |
1513 | 871 | 10% |
1696 | 770 | 70% |
1916 | 693 | 30% |
1983 | 669 | 20% |
2471 | 538 | 5% |
2656 | 503 | 25% |
2717 | 492 | 15% |
2894 | 462 | 80% |
2992 | 447 | 100% |
Rank in Wordlist | Frequency | Word |
---|---|---|
7692 | 163 | RCS&RDS |
10166 | 116 | S&P |
21952 | 42 | IT&C |
27687 | 30 | Ernst&Young |
33929 | 22 | H&M |
34015 | 22 | Q&A |
36033 | 20 | AT&T |
47670 | 13 | Saatchi&Saatchi |
52638 | 11 | R&D |
52639 | 11 | R&S |
Rank in Wordlist | Frequency | Word |
---|---|---|
51917 | 11 | $. |
54880 | 10 | $, |
91053 | 4 | 200$ |
105150 | 3 | 1$ |
105261 | 3 | 1000$ |
109882 | 3 | Ke$ha |
127958 | 2 | $! |
127959 | 2 | $$ |
127960 | 2 | $1.4 |
127961 | 2 | $1500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
127967 | 2 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
14272 | 76 | dom'le |
21261 | 44 | McDonald's |
21967 | 42 | Moody's |
24447 | 36 | Poor's |
33942 | 22 | It's |
35062 | 21 | O'Sullivan |
36518 | 20 | d'Or |
41657 | 16 | Dom'le |
45981 | 14 | d'aia |
52181 | 11 | Christie's |
Rank in Wordlist | Frequency | Word |
---|---|---|
177146 | 1 | 16%+10% |
Rank in Wordlist | Frequency | Word |
---|---|---|
184248 | 1 | 60%*40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
7021 | 181 | km/h |
7097 | 179 | si/sau |
12995 | 86 | lei/euro |
14053 | 77 | 6/49 |
21197 | 44 | 109/2011 |
24190 | 37 | l/mp |
24660 | 36 | lei/MWh |
28666 | 29 | lei/actiune |
29311 | 28 | euro/luna |
31518 | 25 | copy/paste |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots